Pragmatic-Pedagogic Value Alignment

نویسندگان

  • Jaime F. Fisac
  • Monica A. Gates
  • Jessica B. Hamrick
  • Chang Liu
  • Dylan Hadfield-Menell
  • Malayandi Palaniappan
  • Dhruv Malik
  • S. Shankar Sastry
  • Thomas L. Griffiths
  • Anca D. Dragan
چکیده

For an autonomous system to provide value (e.g., to customers, designers, or society at large) it must have a reliable method to determine the intended goal. This is the essence of the value-alignment problem: ensuring that the objectives of an autonomous system match those of its human users. In robotics, value alignment is crucial to the design of collaborative robots that can integrate into human workflows, successfully learning and adapting to the objectives of their users as they go. We argue that a meaningful solution to the value-alignment problem will combine multiagent decision theory with rich mathematical models of human cognition, enabling robots to tap into people’s natural collaborative capabilities. We present a solution to the cooperative inverse reinforcement learning (CIRL) dynamic game using wellestablished models of decision making and theory of mind from cognitive science. The solution accounts for two crucial aspects of collaborative value alignment: that the human will not plan her actions in isolation, but will reason pedagogically about how the robot might learn from them; and that the robot should anticipate this and interpret the human’s actions pragmatically. To our knowledge, this constitutes the first equilibrium analysis of value alignment grounded in an empirically validated cognitive model of the human.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Application of the ABS LX Algorithm to Multiple Sequence Alignment

We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...

متن کامل

Strategic approaches to digital libraries and virtual learning environments (VLEs)

Purpose – Argues that the successful introduction of digital libraries in the 1990s has important lessons for the successful implementation of e-learning strategies. Design/methodology/approach – An opinion piece based on current and recent trends in digital library and e-learning development. Findings – Pragmatic information strategies have important parallels with potentially effective strate...

متن کامل

A Pragmatic Solution to the Value Problem of Knowledge

We value possessing knowledge more than true belief. Both someone with knowledge and someone with a true belief possess the correct answer to a question. Why is knowledge more valuable than true belief if both contain the correct answer? I examine the philosophy of American pragmatist John Dewey and then I offer a novel solution to this question often called the value problem of knowledge. I pr...

متن کامل

A Devolved Ontology Model for the Pragmatic Web

Devolved ontology is an approach to ontology modelling and (co-) evolution which was developed in connection with agile partnerships. Inviting parallels between agile partnerships and the context of the Pragmatic Web suggest that this has potential value in realising the vision of the Pragmatic Web [SMD06]. This is especially clear in their respective uses of ontologies and in particular the de...

متن کامل

Pragmatic Alignment on Social Support Type in Health Forum Conversations

Linguistic alignment, such as lexical and syntactic alignment, is a universal phenomenon influencing dialogue participants in online conversations. While adaptation can occur at lexical, syntactic and pragmatic levels, relationships between alignments at multiple levels are neither theoretically nor empirically well understood. In this study, we find that community members show pragmatic alignm...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1707.06354  شماره 

صفحات  -

تاریخ انتشار 2017